A Two-Stage Ensemble of Diverse Models for Advertisement Ranking in KDD Cup 2012
نویسندگان
چکیده
This paper describes the solution of National Taiwan University for track 2 of KDD Cup 2012. Track 2 of KDD Cup 2012 aims to predict the click-through rate of ads on Tencent proprietary search engine. We exploit classification, regression, ranking, and factorization models to utilize a variety of different signatures captured from the dataset. We then blend our individual models to boost the performance through two stages, one on an internal validation set and one on the external test set. Our solution achieves 0.8069 AUC on the public test set and 0.8089 AUC on the private test set.
منابع مشابه
Personalized Ranking for Non-Uniformly Sampled Items
We develop an adapted version of the Bayesian Personalized Ranking (BPR) optimization criterion (Rendle et al., 2009) that takes the non-uniform sampling of negative test items — as in track 2 of the KDD Cup 2011 — into account. Furthermore, we present a modified version of the generic BPR learning algorithm that maximizes the new criterion. We use it to train ranking matrix factorization model...
متن کاملCombining Predictors for Recommending Music: the False Positives' approach to KDD Cup track 2
We describe our solution for the KDD Cup 2011 track 2 challenge. Our solution relies heavily on ensembling together diverse individual models for the prediction task, and achieved a final leaderboard misclassification rate of 3.8863%. This paper provides details on both the modeling and ensemble
متن کاملBayesian Personalized Ranking for Non-Uniformly Sampled Items
In this paper, we describe our approach to track 2 of the KDD Cup 2011. The task was to predict which 3 out of 6 candidate songs were positively rated – instead of not rated at all – by a user. The candidate items were not sampled uniformly, but according to their general popularity. We develop an adapted version of the Bayesian Personalized Ranking (BPR) optimization criterion [9] that takes t...
متن کاملNovel Models and Ensemble Techniques to Discriminate Favorite Items from Unrated Ones for Personalized Music Recommendation
The track 2 problem in KDD Cup 2011 (music recommendation) is to discriminate between music tracks highly rated by a given user from those which are overall highly rated, but not rated by the given user. The training dataset consists of not only user rating history but also the taxonomic information of track, artist, album, and genre. This paper describes the solution of the National Taiwan Uni...
متن کاملFeature Engineering and Ensemble Modeling for Paper Acceptance Rank Prediction
Measuring research impact and ranking academic achievement are important and challenging problems. Having an objective picture of research institution is particularly valuable for students, parents and funding agencies, and also attracts attention from government and industry. KDD Cup 2016 proposes the paper acceptance rank prediction task, in which the participants are asked to rank the import...
متن کامل